On a Problem Posed by Maurice Nivat 

Maxim A. Babenko * 
February 2, 2008 

Abstract 

Consider a m x n matrix A, whose elements are arbitrary integers. 
Consider, for each square window of size 2x2, the sum of the correspond- 
ing elements of A. These sums form a (m — 1) x (n — 1) matrix S. Can 
we efficiently (in polynomial time) restore the original matrix A given S? 

This problem was originally posed by Maurice Nivat for the case when 
the elements of matrix A are zeros and ones. We prove that this problem is 
solvable in polynomial time. Moreover, the problem still can be efficiently 
solved if the elements of A are integers from given intervals. 

On the other hand, for 2x3 windows the similar problem turns out 
to be NP-complete. 

1 Introduction 

By Mmn we denote the set of all n x to integer matrices. For a given matrix 
A S Mmn consider the matrix of sums for all 2 x 2 windows, denoted by S = 
^22{A). Here indices denote the window size. More generally, a mapping Y,m'n' '■ 
Mmn Mm-m' +i,n-n' +1 (for to' X n' wlndow) Is defined in a similar way. 

Now let A be a matrix with 0-1 elements and S = T,22{A). How can we 
restore A knowing 5"? First of all, note that S could have many preimages. (For 
example, consider an arbitrary 0-1 matrix such that every its column is formed 
by alternating zeros and ones. Clearly all elements of S are equal to 2.) So our 
goal is to find (efficiently) one of the preimages of S if they exist. 

We also consider a more general problem with upper constraints. Namely, 
given a matrix 5 of to' x n' sums and upper constraints matrix U G Mmn we 
look for a matrix A such that 

0<A<U, 

As usual, < A < U means that < Aij < Uij for all i and j. 

The original problem (with Uij — 1) is called binary. In this paper we prove 
the following results: 
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Theorem 1 The binary problem with 2x2 window is solvable in polynomial 
time. Moreover, in a typical RAM model it can he solved in 0{mn) time. 

Theorem 2 In a typical RAM model the problem with 2x2 window and upper 
constraints is solvable in 0{mn{m+n){l + Um,in)) time, where Umin = miriij Uij. 

Suppose the elements of U and S are given in unary notation. Then The- 
orem 121 imphes that the binary problem with upper constraints is solvable in 
polynomial time. The next theorem shows the hardness of the similar problem 
for 2x3 window. 

Theorem 3 The problem with upper constraints (given in unary notation) and 
2x3 window is NP- complete. 

2 Binary Problem for 2x2 Window 

Let A be a matrix we are looking for and S be the matrix of sums that is given to 
us. We number the rows and the columns starting from zero (rows 0, . . . , m — 1 
and columns 0, . . . , n — 1). 

Note, that it is sufficient to restore only the elements in the zero row and 
column of A. After that, all other elements are determined uniquely. We start 
with an observation that works not only in the binary case [Aij G {0, 1}), but 
also for any upper constraints (0 < Aij < Uij). 

We may assume that Aqq is already known (since we can consider all C/qo + 1 
possible cases one by one). Let xi, . . . ,a;„_i be the remaining elements of the 
zero row of A and yi, . . . , ym-i be the remaining elements of the zero column: 
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Easy induction shows that 

A^j = {-lYxj + {-iyy, + bij, 

where hij are some constants depending on A^q and matrix S. The numbers 5^ 
can be computed in 0{m,n) time. So we get the following requirements for Xj 
and yi'. 

< Xj < Uoj; 

< y^ < Um; (1) 

-hj < i-iyx. + i-iyy, < U,j-b,j. 

Moreover, if conditions are satisfied for some Xj and yi, then correspond- 
ing matrix A provides a solution for the original problem with upper constraints. 
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Our algorithm uses that each inequahty in ^ depends on at most two 
variables. Suppose we consider the binary case. Then Xj and yi are Boolean 
variables and the inequalities JQ) can be written as a Boolean formula. Indeed, 
for each pair the inequality —bij < {—lyxj + (— < 1 — hj forbids 

some pairs of values {xj,yi). Putting these restrictions together we obtain a 
2-CNF formula in Xj, yi. It is clear that the size of this formula is 0{mn). 

A well-known fact is that for a given 2-CNF formula one can find whether 
it is satisfiable or not in polynomial time (and find a satisfying assignment if it 
exists). This problem is often called 2- SAT problem. Moreover, there exists an 
algorithm solving 2-SAT that runs in linear time (in the length of the formula). 
Our formula is of 0(mn) size, and hence we obtain the proof of Theorem^ 

In the rest of the section we briefly outline the idea behind the linear time 
algorithm for solving 2-SAT problem. Let {zi} be the set of Boolean variables. 
A literal is a variable Zi (denoted by zf) or its negation (denoted by zj). By 
2-CNF we mean a formula (j) in conjunctive normal form where each clause is a 
disjunction of at most two literals. Without loss of generality we may assume 
that every clause has exactly two literals (maybe identical). 

Converting the disjunctions into implications we get: 



where Ci, tt^ G {0, 1}. 

Our first step is to construct a directed graph G = {V,E), where V is the 
set of literals: 



For each implication u ^ v (where u, v are literals) we add arcs u — > w and 
V ^ u (here zf denotes zl'"'). 

To satisfy (j> means to label vertices in this graph by Boolean values in such 
a way that z° and zj get opposite values and there is no arc going from a true 
vertex to a false one. 

The size of the graph is linear in the length of (j). We calculate the strongly- 
connected components of G using depth- first search twice (see P). This requires 
linear time. 

Suppose literals z° and zj (for some i) belong to the same strongly-connected 
component. Then (j) is unsatisfiable since it implies both Zi Zi and z,; Zi. 

On the other hand, if literals z^ and z} are in different components for each i, 
then formula is satisfiable. To show this we perform a topological sort of the 
components. In other words, we assign natural numbers to the components in 
such a way that for each arc going from component Ci to component Cj we 
have i < j. 

Now we describe how to assign Boolean values to variables Zi. Consider a 
pair of literals Zk and Zk. Let Ci be the component containing Zk and Cj be the 
component containing z^. If i < j then we put Zk — false. Otherwise i > j 
since Zk and Zk are in different components. In this case put Zk — true. It 




V = {z^,zl}. 
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remains to prove that these values satisfy the formula 4>, i.e., that no arc goes 

from TRUE to FALSE. 

Suppose the contrary and let u — > ?; be such an arc (here u and v are 
literals). Let Ci denote the component containing u and let Cj be the component 
containing v. Then i < j. Consider vertex u and vertex v. Let Ci' and Cj/ be 
their components. Since u = true and v = false we have i' < i and j' > j, 
hence i' < j'. On the other hand the graph contains the arc v ^ u that violates 
topological order. The correctness of the algorithm is now established. 

It is clear that using appropriate data structures this algorithm can be im- 
plemented in linear time. 

3 The Case of 2 x 2 Window and Arbitrary Up- 
per Constraints 

Now suppose that Aij are integers in the range . . .Uij. We use the fact that 
the problem can be reduced to the set of inequalities 1^. As above, we consider 
each all possibilities for Aqo separately. 

We let Xj — {—lyaj, yi = (— 1)^+^/3^. Then the inequalities ^ become 
two-sided constraints on aj, [3i and the differences aj — f3i: 

L] < a, < U}, 

Ll < A: < Ui, (2) 
4 < < f/g 

for some Lj, Uj, L|, Uf, L^j, Ufj. Consider a more general (and more "uni- 
form") set of inequalities: 

4 < < Ul (3) 

L% < a, -A < U!^. 

These two systems of inequalities are either both consistent or both inconsistent. 
Indeed, every integer solution {aj,f3i) of |(2Jl can be transformed into a solution 
of (O by setting 9 = 0. And visa versa, if {aj,Pi,9) is an integer solution of 
then {aj — 9,f]i — 9) is an integer solution of Thus it is enough to consider 
inequalities l|3Jl only. 

This set of inequalities has a form of difference constraints. Using Ford- 
Bellman algorithm (see T) we may find an integer solution for (|3Jl or establish 
that it does not exist in 0{mn{m + n)) time. 

Namely, suppose we have a set of variables {zi} and a set of difference 
constraints Zi — Zj < Wij for some i,j and integer constants Wij. Our task is 
to find an integer solution (if it exists) for this set of inequalities. To do so, we 
consider a directed graph G ~ {V, E) constructed in the following way. Each 
variable Zi becomes a vertex in V . We also add an auxiliary vertex s to V . For 
each inequality Zi — Zj < Wij we add an arc of length Wij from Zj to Zi . Finally, 
for each i we add an arc s ^ of zero length. 



4 



Clearly, the number of arcs in the resulting graph is linear in the num- 
ber of constraints of the original system of inequalities. We invoke Ford- 
Bellman's shortest-path algorithm starting from the vertex s. This algorithm 
runs in 0{VE) time and either finds a cycle of negative length or computes the 
distances from the origin s to all vertices reachable from s. 

Suppose there is a cycle of negative length in G. Then it cannot pass through 
origin s since it has no incoming arcs. Hence each of the arcs of the cycle corre- 
sponds to some inequality. Summing up these inequalities we get a contradiction 
showing that the set of inequalities is inconsistent. Otherwise let d{u) be the 
distance from the origin s to a vertex u. Then triangle inequality shows that 
the distances d{u) obey all the difference constraints. Moreover, these distances 
are integers (since the lengths Wij are integers). 

The total running time of the algorithm is 0{mn{m -I- rt)(l -I- Uqq)) (recall 
that V = 0{m + n), E = 0{mn) and there are 1 -I- [/qo possible values of ^oo)- 
This time bound can be improved a bit. One may see that instead of Aqq we 
may choose an arbitrary element Aij instead of Aqq thus proving Theorem |21 
The running time is polynomial provided that the elements of U are given in 
unary notation. An open question is if there exists an algorithm whose running 
time is poly(log?7). 



4 NP-completeness of the 2x3 Window Case 
With Upper Constraints 

In this section we prove that the problem for 2x3 windows and upper constraints 
in unary notation is NP-complete. More precisely, consider the following rela- 
tion: 

R = {{A, S,U)\0<A<U, E23(A) = S}. 

Here A, S and U are matrices of any appropriate size. This relation corresponds 
to the language L{R) consisting of pairs (5", U) for which the problem has a 
solution: 

L{R) = {{S, U) I 3A {A, S, U) e R}. 

It is clear that L{R) G NP. We present a Karp reduction from a 3-coloring 
problem to L{R) thus proving the NP-completeness of L{R). 

It is convenient to consider a slightly more general form of the problem by 
imposing two-sided constraints on the elements of matrix A: 

L<A<U; 

^23{A) = S. 

Computationally this problem is not harder than the original one. Indeed, 
let A = L + X, where X € Mmn- Then constraints L < A < U become 
equivalent toO<X < U — L. Thus we have reduced the problem with two- 
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sided constraints to the problem with upper constraints U' and sums 6", where 

U' = U -L; 

S' = S- S23(i). 

The matrices U' and S' can be computed in 0{mn) time. 
Consider a (m + 1) x (3n + 2) matrix A of the form: 
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The properties Aqq = Aqi =0 can be ensured by setting Lqo = Lqi = Uqq = 
Uoi = 0. We put S = and thus all 2 x 3 sums of A are zeros. Then as in the 
case of 2 X 2 windows one may see that for every i,j > 1 

Asi = {-iy+\xi - p,); 
Ai,33+i = i-iy+^iVi-qj); 

A,3j+2 = {-^YiXi +yi + Zj). 

Therefore, any system of two-sided constraints on the values 

•^ij Vji Zj, Pj, Qj, 

Xi — Pj , /gN 

Vi-qj, 

Xi ~\~ yi ~\~ Zj 

may be reduced to L{R). 

Note that these expressions are of some very special form (variables are 
divided into five groups and only some combinations are allowed). However, it 
turns out that any system of two-sided constraints on sums of at most three 
variables can be reduced to this special case. 

Using variables pj , we can represent an equation Xa = x^ (for arbitrary a, /3) 
as follows: 

< Xa — Pj < 0; 
< xp — Pj < 0. 

(we use a "fresh" index j for each equation). Except for that, we do not use 
variables pj . The equations ya = y^ can be expressed in a similar way using qj . 
Now we show how to write an equation Xa = yp for arbitrary a, /?. Again 
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we choose "fresh" indices z, j and k and write 

Vi = 0; 
Xi+yi + Zj ^ 0; 
yp = Vk; 
Xk = 0; 
Xk + Vk + Zj = 0. 

Equation Za ~ z^ becomes 





Xi 


= 0; 


Xi + Hi 
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with "fresh" indices i and j. 

The last issue is an equation Xa — zp. Consider "fresh" indices i, j and 
write 

Xi Xaj 

Xj ~ 0; 
Xj + Vj + -2/3 = 0; 

Z{) = 0; 
x^ + + zo = 0. 

(We may use the same variable zq in all such equations.) Now all variable groups 
Xi, Di and Zj have become fully symmetric and a two-sided constraint may be 
enforced for a sum of arbitrary two or three variables as required. 

Consider an undirected graph G = {V,E). A valid 3- coloring of G assigns 
one of three colors to each vertex of G in such a way that no edge connects 
the vertices of the same color. The graph 3-coloring problem is to find a valid 
3-coloring of G or establish that it does not exist. The corresponding language 

3-COL = {G \ graph G admits a valid 3-coloring } 

is known to be NP-complete (see ^). 

This problem can be stated as an integer program in the following way. 
Assign three integer variables x^, Uv, Zy (corresponding to three possible colors) 
to each vertex of G. Each of variables should be either or 1: 

< Xi, < 1, < ?;„ < 1, < z„ < 1. 
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Since each vertex should be assigned a color 

•^v Vv ~t~ — -1 • 

The requirement that no edge connects the vertices of the same color produces 
the following set of inequalities for each edge uv G E: 

Xii ~j~ ^ 1 , 

Uu + yi, < 1; 

Zu + Zy < 1. 

All these inequalities are constraints on the sum of at most three variables. 
Thus these inequalities are equivalent to some 2x3 problem with two-sided 
constraints. Clearly this reduction can be performed in polynomial time and 
produces matrices L, U and S of polynomial size. Thus we have obtained the 
proof of Theorem 121 
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